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1 <110> APPLICANT: Zavada, Jan 

2 Pastorekova, Silvia 

3 Pastorek, Jaromir 

4 <120> TITLE OF INVENTION: MN Gene and Protein 

5 <130> PILE REFERENCE: D-0021.5 PCT 

6 <140> CURRENT APPLICATION NUMBER: US/09/807 , 94 9A 

7 <141> CURRENT FILING DATE: 2001-08-09 

8 <150> PRIOR APPLICATION NUMBER: 09/177/776 

9 <151> PRIOR FILING DATE: 1998-10-23 

10 <150> PRIOR APPLICATION NUMBER: 09/178/115 

11 <151> PRIOR FILING DATE: 1998-10-23 

12 <160> NUMBER OF SEQ ID NOS : 143 

13 <170> SOFTWARE: Patentln Ver . 2.1 

15 <210> SEQ ID NO: 1 

16 <211> LENGTH: 1522 

17 <212> TYPE: DNA 

18 <213> ORGANISM: HUMAN 

19 <220> FEATURE: 

20 <221> NAME/KEY 




CDS 

(13) . . (1389) 
mat_peptide 
(124) . . (1389) 
1 

2 5 acagtcagcc gc atg get ccc ctg tgc ccc age ccc tgg etc cct ctg ttg 51 

26 Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu 

27 -35 -30 -25 



21 <222> LOCATION 

22 <221> NAME/KEY 
2 3 <222> LOCATION 
24 <400> SEQUENCE 
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98 Asn Gly Arg Val lie Glu Ala Ser Phe Pro Ala Gly Val Asp Ser Ser 

99 345 350 355 360 

100 cct egg get get gag cca gtc cag ctg aat tec tgc ctg get get ggt 12 51 

101 Pro Arg Ala Ala Glu Pro Val Gin Leu Asn Ser Cys Leu Ala Ala Gly 

102 365 370 375 

103 gac ate eta gee ctg gtt ttt ggc etc ctt ttt get gtc acc age gtc 1299 

104 Asp lie Leu Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val 

105 380 385 390 

106 gcg ttc ctt gtg cag atg aga agg cag cac aga agg gga acc aaa ggg 1347 

107 Ala Phe Leu Val Gin Met Arg Arg Gin His Arg Arg Gly Thr Lys Gly 

108 395 400 405 

109 ggt gtg age tac cgc cca gca gag gta gee gag act gga gec 1389 

110 Gly Val Ser Tyr Arg Pro Ala Glu Val Ala Glu Thr Gly Ala 

111 410 415 420 

112 tagaggctgg atcttggaga atgtgagaag ccagccagag gcatctgagg gggagccggt 14 4 9 

113 aactgtcctg tcctgctcat tatgecaett ccttttaact gecaagaaat tttttaaaat 1509 

114 aaatatttat aat 1522 



116 <210> SEQ ID NO: 2 

117 <211> LENGTH: 459 

118 <212> TYPE: PRT 

119 <213> ORGANISM: HUMAN 

120 <400> SEQUENCE: 2 



• 121 Met Ala Pro Leu Cys Pro Ser Pro Trp Leu Pro Leu Leu lie Pro Ala 

122 -35 -30 -25 

12 3 Pro Ala Pro Gly Leu Thr Val Gin Leu Leu Leu Ser Leu Leu Leu Leu 

124 -20 -15 -10 

12 5 Met Pro Val His Pro Gin Arg Leu Pro Arg Met Gin Glu Asp Ser Pro 

126 -5 -11 5 10 

127 Leu Gly Gly Gly Ser Ser Gly Glu Asp Asp Pro Leu Gly Glu Glu Asp 

128 15 20 25 

12 9 Leu Pro Ser Glu Glu Asp Ser Pro Arg Glu Glu Asp Pro Pro Gly Glu 

130 30 35 40 

131 Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro Gly Glu Glu Asp Leu Pro 

132 45 50 55 

133 Glu Val Lys Pro Lys Ser Glu Glu Glu Gly Ser Leu Lys Leu Glu Asp 

134 60 65 70 75 

135 Leu Pro Thr Val Glu Ala Pro Gly Asp Pro Gin Glu Pro Gin Asn Asn 

136 80 85 90 

137 Ala His Arg Asp Lys Glu Gly Asp Asp Gin Ser His Trp Arg Tyr Gly 

138 95 100 105 

139 Gly Asp Pro Pro Trp Pro Arg Val Ser Pro Ala Cys Ala Gly Arg Phe 

140 110 115 120 

141 Gin Ser Pro Val Asp lie Arg Pro Gin Leu Ala Ala Phe Cys Pro Ala 

142 125 130 135 

14 3 Leu Arg Pro Leu Glu Leu Leu Gly Phe Gin Leu Pro Pro Leu Pro Glu 

144 140 145 150 155 

14 5 Leu Arg Leu Arg Asn Asn Gly His Ser Val Gin Leu Thr Leu Pro Pro 

146 160 165 170 

147 Gly Leu Glu Met Ala Leu Gly Pro Gly Arg Glu Tyr Arg Ala Leu Gin 
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148 175 180 185 

149 Leu His Leu His Trp Gly Ala Ala Gly Arg Pro Gly Ser Glu His Thr 

150 190 195 200 

151 Val Glu Gly His Arg Phe Pro Ala Glu lie His Val Val His Leu Ser 

152 205 210 215 

153 Thr Ala Phe Ala Arg Val Asp Glu Ala Leu Gly Arg Pro Gly Gly Leu 

154 220 225 230 235 

155 Ala Val Leu Ala Ala Phe Leu Glu Glu Gly Pro Glu Glu Asn Ser Ala 

156 240 245 250 

157 Tyr Glu Gin Leu Leu Ser Arg Leu Glu Glu lie Ala Glu Glu Gly Ser 

158 255 260 265 

159 Glu Thr Gin Val Pro Gly Leu Asp lie Ser Ala Leu Leu Pro Ser Asp 

160 270 275 280 

161 Phe Ser Arg Tyr Phe Gin Tyr Glu Gly Ser Leu Thr Thr Pro Pro Cys 

162 285 290 295 

16 3 Ala Gin Gly Val lie Trp Thr Val Phe Asn Gin Thr Val Met Leu Ser 

164 300 305 310 315 

16 5 Ala Lys Gin Leu His Thr Leu Ser Asp Thr Leu Trp Gly Pro Gly Asp 

166 320 325 330 

167 Ser Arg Leu Gin Leu Asn Phe Arg Ala Thr Gin Pro Leu Asn Gly Arg 

168 335 340 345 

169 Val lie Glu Ala Ser Phe Pro Ala Gly Val Asp Ser Ser Pro Arg Ala 

170 350 355 360 

171 Ala Glu Pro Val Gin Leu Asn Ser Cys Leu Ala Ala Gly Asp lie Leu 

172 365 370 375 

173 Ala Leu Val Phe Gly Leu Leu Phe Ala Val Thr Ser Val Ala Phe Leu 

174 380 385 390 395 

175 Val Gin Met Arg Arg Gin His Arg Arg Gly Thr Lys Gly Gly Val Ser 

176 400 405 410 

177 Tyr Arg Pro Ala Glu Val Ala Glu Thr Gly Ala 

178 415 420 



180 <210> SEQ ID NO: 3 

181 <211> LENGTH: 29 

182 <212> TYPE: DNA 

183 <213> ORGANISM: HUMAN 

184 <400> SEQUENCE: 3 

185 cgcccagtgg gtcatcttcc ccagaagag 2 9 

187 <210> SEQ ID NO: 4 

188 <211> LENGTH: 19 

189 <212> TYPE: DNA 

190 <213> ORGANISM: HUMAN 

191 <400> SEQUENCE: 4 

192 ggaatcctcc tgcatccgg 19 

194 <210> SEQ ID NO: 5 

195 <211> LENGTH: 10898 

196 <212> TYPE: DNA 

197 <213> ORGANISM: HUMAN 

198 <220> FEATURE: 

199 <221> NAME/KEY: gene 
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200 <222> LOCATION: (1).. (10898) 

201 <22 3> OTHER INFORMATION: full-length MN genomic sequence 
W--> 202 <221> NAME/KEY: unsure of base at position 1974 

203 <222> LOCATION: (1974) 

204 <223> OTHER INFORMATION: unsure of base at position 1974 , which is in the 5' region 

transcription initiation site (3507) as determined by RNase protection assay. 
SEQUENCE : 5 

ggatcctgtt gactcgtgac cttaccccca accctgtgct ctctgaaaca tgagctgtgt 6 0 

ccactcaggg ttaaatggat taagggcggt gcaagatgtg ctttgttaaa cagatgcttg 120 

aaggcagcat gctcgttaag agtcatcacc aatccctaat ctcaagtaat cagggacaca 180 

aacactgcgg aaggccgcag ggtcctctgc ctaggaaaac cagagacctt tgttcacttg 240 

tttatctgac cttccctcca ctattgtcca tgaccctgcc aaatccccct ctgtgagaaa 300 

cacccaagaa ttatcaataa aaaaataaat ttaaaaaaaa aatacaaaaa aaaaaaaaaa 360 

aaaaaaaaaa gacttacgaa tagttattga taaatgaata gctattggta aagccaagta 4 20 

aatgatcata ttcaaaacca gacggccatc atcacagctc aagtctacct gatttgatct 4 80 

ctttatcatt gtcattcttt ggattcacta gattagtcat catcctcaaa attctccccc 540 

aagttctaat tacgttccaa acatttaggg gttacatgaa gcttgaacct actaccttct 6 00 

ttgcttttga gccatgagtt gtaggaatga tgagtttaca ccttacatgc tggggattaa 6 60 

tttaaacttt acctctaagt cagttgggta gcctttggct tatttttgta gctaattttg 720 

tagttaatgg atgcactgtg aatcttgcta tgatagtttt cctccacact ttgccactag 780 

gggtaggtag gtactcagtt ttcagtaatt gcttacctaa gaccctaagc cctatttctc 840 

ttgtactggc ctttatctgt aatatgggca tatttaatac aatataattt ttggagtttt 900 

tttgtttgtt tgtttgtttg tttttttgag acggagtctt gcatctgtca tgcccaggct 960 

ggagtagcag tggtgccatc tcggctcact gcaagctcca cctcccgagt tcacgccatt 1020 

ttcctgcctc agcctcccga gtagctggga ctacaggcgc ccgccaccat gcccggctaa 1080 

ttttttgtat ttttggtaga gacggggttt caccgtgtta gccagaatgg tctcgatctc 114 0 

ctgacttcgt gatccacccg cctcggcctc ccaaagttct gggattacag gtgtgagcca 1200 

ccgcacctgg ccaatttttt gagtctttta aagtaaaaat atgtcttgta agctggtaac 1260 

tatggtacat ttccttttat taatgtggtg ctgacggtca tataggttct tttgagtttg 1320 

gcatgcatat gctacttttt gcagtccttt cattacattt ttctctcttc atttgaagag 1380 

catgttatat cttttagctt cacttggctt aaaaggttct ctcattagcc taacacagtg 1440 

tcattgttgg taccacttgg atcataagtg gaaaaacagt caagaaattg cacagtaata 1500 

cttgtttgta agagggatga ttcaggtgaa tctgacacta agaaactccc ctacctgagg 1560 

tctgagattc ctctgacatt gctgtatata ggcttttcct ttgacagcct gtgactgcgg 1620 

actatttttc ttaagcaaga tatgctaaag ttttgtgagc ctttttccag agagaggtct 1680 

catatctgca tcaagtgaga acatataatg tctgcatgtt tccatatttc aggaatgttt 1740 

gcttgtgttt tatgctttta tatagacagg gaaacttgtt cctcagtgac ccaaaagagg 1800 

tgggaattgt tattggatat catcattggc ccacgctttc tgaccttgga aacaattaag 1860 

ggttcataat ctcaattctg tcagaattgg tacaagaaat agctgctatg tttcttgaca 1920 

ttccacttgg taggaaataa gaatgtgaaa ctcttcagtt ggtgtgtgtc cctngttttt 1980 

ttgcaatttc cttcttactg tgttaaaaaa aagtatgatc ttgctctgag aggtgaggca 2040 

ttcttaatca tgatctttaa agatcaataa tataatcctt tcaaggatta tgtctttatt 2100 

ataataaaga taatttgtct ttaacagaat caataatata atcccttaaa ggattatatc 2160 

tttgctgggc gcagtggctc acacctgtaa tcccagcact ttgggtggcc aaggtggaag 2220 

gatcaaattt gcctacttct atattatctt ctaaagcaga attcatctct cttccctcaa 2280 

tatgatgata ttgacagggt ttgccctcac tcactagatt gtgagctcct gctcagggca 2 34 0 

ggtagcgttt tttgtttttg tttttgtttt tcttttttga gacagggtct tgctctgtca 2400 

cccaggccag agtgcaatgg tacagtctca gctcactgca gcctcaaccg cctcggctca 2460 

aaccatcatc ccatttcagc ctcctgagta gctgggacta caggcacatg ccattacacc 2520 



flanking 


the 


205 




206 


<400> 


207 




208 




209 




ft 

210 




211 




212 




213 




214 




215 




216 




217 




218 




219 




220 




221 




222 




223 




224 




225 




226 




227 




228 




229 




2 30 




2 31 




232 




233 




234 




235 




236 




237 




238 




W--> 239 




240 




241 




242 




243 




244 




245 




246 




247 




248 





nit-f n c /or X3a has been detected m the Sequence U$t\n« 
Review the Sequence Listing ro insure a corresponding 8 
expatriation is presented in the <220> to <U3> fields of 
each sequence using n or Xaa, - - 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/8 07 , 949A 



DATE : 10/17/2001 
TIME: 12:15:09 



L : 


202 


M: 257 W: 


L : 


239 


M : 341 W : 


L : 


424 


M : 2 57 W : 


L: 


569 


M: 257 W: 


L: 


574 


M:341 W: 


L: 


586 


M:257 W: 


L: 


591 


M:341 W: 


L: 


1119 


M:259 W 


L: 


1120 


M:257 W 


L: 


1126 


M:259 W 


L: 


1160 


M:341 W 


L: 


1469 


M:257 W 


L: 


1475 


M:257 W 


L: 


1516 


M:341 W 


L: 


1723 


M:257 W 


L: 


1740 


M:341 W 



Input Set : N:\Crf3\10162001\l807949.raw 
Output Set: N:\CRF3\10172001\I807949A.raw 

Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 5 
(46) "n" or "Xaa" used, for SEQ ID#:5 

Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 9 
Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:25 
(46) "n" or "Xaa" used, for SEQ ID#:25 

Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:26 

(46) "n" or "Xaa" used, for SEQ ID#:26 
: Allowed number of lines exceeded, <223> Other Information: 
: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 58 
: Allowed number of lines exceeded, <223> Other Information: 
: (46) "n" or "Xaa" used, for SEQ ID#:58 

: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:90 
: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:90 
: (46) n n n or "Xaa" used, for SEQ ID#:90 

: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:110 
: (46) n n n or "Xaa" used, for SEQ ID#:110 
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STATISTICS SUMMARY DATE: 10/17/2001 

PATENT APPLICATION: US/09/807 , 949A TIME: 12:15:09 

Input Set : N:\Crf3\10162001\I807949.raw 
Output Set: N:\CRF3\10172001\I807949A.raw 

Application Serial Number: US/09/807 , 949A 

Alpha or Numeric: Numeric 

Application Class: 

Application File Date: 08-09-2001 

Art Unit: PCT09 

Software Application: Patentln 
Total Number of Sequences: 143 
Total Nucleotides: 33606 
Total Amino Acids: 1828 
Number of Errors : 0 
Number of Warnings: 16 
Number of Corrections : 0 

MESSAGE SUMMARY 

257 W: 8 (Feature value mis-spelled or invalid) 
259 W: 2 (Allowed number of lines exceeded) 
341 W: 6 ((46) "n" or "Xaa" used) 
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